Skywork Reward Gemma 2 27B V0.2
A high-performance reward model built on the Gemma-2-27B architecture, trained using the purified Skywork-Reward-Preference-80K-v0.2 dataset, excelling in preference judgment in complex scenarios.
Large Language Model
Transformers